Vine parsing augmented Italian treebanks
نویسندگان
چکیده
This brief article describes our contribution to the EVALITA 2009 Parsing Task, dependency track. The TUT and ISST treebanks are augmented with additional features. MIRA is used to find a weight matrix suited for the Covington algorithm, which is subsequently skewed by discriminatively learned hard constraints on dependency lengths. Our skewed algorithm is linear time and thus asymptotically faster than the cubic time Covington algorithm, but increases in performance are insignificant. Our overall system is thus non-competitive. It is shown, however, that it does contribute significantly to the performance of a more competitive ensemble-based system.
منابع مشابه
Training Parsers on Incompatible Treebanks
We consider the problem of training a statistical parser in the situation when there are multiple treebanks available, and these treebanks are annotated according to different linguistic conventions. To address this problem, we present two simple adaptation methods: the first method is based on the idea of using a shared feature representation when parsing multiple treebanks, and the second met...
متن کاملStatistical Dependency Parsing of Four Treebanks
Multilingual dependency parsing is gaining popularity in recent years for several reasons. Dependency structures are more adequate for languages with freer word order than the traditional constituency notion. There is a growing availability of dependency treebanks for new languages. Broad coverage statistical dependency parsers are available and easily portable to new languages. Dependency pars...
متن کاملComparing the Influence of Different Treebank Annotations on Dependency Parsing
As the interest of the NLP community grows to develop several treebanks also for languages other than English, we observe efforts towards evaluating the impact of different annotation strategies used to represent particular languages or with reference to particular tasks. This paper contributes to the debate on the influence of resources used for the training and development on the performance ...
متن کاملMeasuring Parsing Difficulty Across Treebanks
One of the main difficulties in statistical parsing is associated with the task of choosing the correct parse tree for the input sentence, among all possible parse trees allowed by the adopted grammar model. While this difficulty is usually evaluated by means of empirical performance measures, such as labeled precision and recall, several theoretical measures have also been proposed in the lite...
متن کاملDependency And Relational Structure In Treebank Annotation
Among the variety of proposals currently making the dependency perspective on grammar more concrete, there are several treebanks whose annotation exploits some form of Relational Structure that we can consider a generalization of the fundamental idea of dependency at various degrees and with reference to different types of linguistic knowledge. The paper describes the Relational Structure as th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009